Dynamic Documents Indexing in Evolving Contexts
نویسندگان
چکیده
In this paper we consider similarity between documents in contexts defined by ontologies. Two documents have different similarity degree depending on the context in which they are considered. We consider a scenario where context evolves incrementally, necessitating continual revision of similarity measures between documents. We develop algorithms that revise similarity between documents concurrently with operations updating the ontology.
منابع مشابه
Efficient semantic indexing via neural networks with dynamic supervised feedback
We describe a portable system for e cient semantic indexing of documents via neural networks with dynamic supervised feedback. We initially represent each document as a modified TF-IDF sparse vector and then apply a learned mapping to a compact embedding space. This mapping is produced by a shallow neural network which learns a latent representation for the textual graph linking words to nearby...
متن کاملExemplary documents: a foundation for information retrieval design
Documents are generally represented for retrieval by either extracting index terms from them or by creating and selecting from an external set of candidate terms. There are many procedures for doing this, but while work continues along these dimensions, there have been relatively few attempts to change this basic process. Of particular importance is the creation of indexing schemes for retrieva...
متن کاملEfficient Dynamic Indexing and Retrieval of XML Documents using Three- Dimensional Quasi-BitCube
XML is a new standard for exchanging and representing data on the Internet. Techniques for indexing and retrieval of XML data is drawing increasing attention since they enable one to access certain parts of retrieved documents easily. However, they provide little or no support for adding new documents to an existing document collection, requiring instead that the entire collection be re-indexed...
متن کاملWordSieve: A Method for Real-Time Context Extraction
In order to be useful, intelligent information retrieval agents must provide their users with context-relevant information. This paper presents WordSieve, an algorithm for automatically extracting information about the context in which documents are consulted during web browsing. Using information extracted from the stream of documents consulted by the user, WordSieve automatically builds conte...
متن کاملContext based Web Indexing for Storage of Relevant Web Pages
A focused crawler downloads web pages that are relevant to a user specified topic. The downloaded documents are indexed with a view to optimize speed and performance in finding relevant documents for a search query at the search engine side. However, the information will be more relevant if the context of the topic is also made available to the retrieval system. This paper proposes a technique ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005